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Abstract. We show that almost surely the rank of the adjacency matrix 
of the Erdos-Renyi random graph G(n,p) equals the number of non-isolated 
vertices for any clnn/n < p < 1/2, where c is an arbitrary positive constant 
larger than 1/2. In particular the giant component (a.s.) has full rank in this 
range. 



1. Introduction 

Let G be a (simple) graph on n points {1, . . . , n}. The adjacency matrix Qq of G 
is a symmetric n by n matrix, whose ij entry is one if the vertex i is connected to 
the vertex j and zero otherwise. 

The rank of Qq is a basic graph parameter of the graph and has been used in 
graph theory and also in applications from computer science. Thus, it is natural 
and perhaps important to understand the behavior of this parameter with respect 
to a random graph. 

An interesting feature of this parameter is that, unlike most of graph parameters 
(such as the connectivity, the chromatic number etc), the rank is not monotone 
(adding edges may reduce the rank). So, it is not clear that one can define thresh- 
olds. Our study, however, will shed some light on this problem. 

There are several models for random graphs. We will focus on the most popular 
one, the Erdos-Renyi G(n,p) model (some other models will be discussed in the 
concluding remarks). In this model, one starts with the vertex set {1, . . . , n} and 
puts an edge (randomly and independently) between any two distinct vertices i and 
j with probability p. We say that a property P holds almost surely for G(n,p) if 
the probability that G(n,p) possesses P goes to one as n tends to infinity. 

A vertex v of a graph G is isolated if it has no neighbor. If v is isolated, then the 
row corresponding to v in Qq is all-zero. Let i(G) denote the number of isolated 
vertices of G. It is clear that 

Fact 1.1. For any graph G, rank{Qo) < n — i(G). 
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The main result of this paper shows that for p sufficiently large, the above upper 
bound is tight for G(n,p). In other words, any non-trivial linear dependence in 
G(n,p) comes from the isolated vertices. 

Theorem 1.2. Let c be a constant larger than |. Then for any clnn/n < p < ^, 
the following holds with probability 1 — 0((lnlnn) -1 / 4 ) for a random sample G from 
G(n,p): 

rank(G) = n — i{G). 

Remark 1.3. This result is sharp in two ways. Obviously, the estimate 
rank(G) = n — i{G) cannot be improved. Furthermore, the bound c > 1/2 is also 
the best possible. For c < 1/2 and p — c\nn/n, a random sample G from G(n,p) 
satisfies the strict inequality 

rank(G) < n - i(G) 

almost surely. In order to see this, notice that in this range G(n,p) almost surely 
contains two vertices u and v with degree one sharing a common neighbor. The 
rows corresponding to u and v arc not zero, but they are equal and this reduces 
the rank further. 

Let us now deduce a few corollaries. It is well known that p > clnn/n for some 
c > 1/2, then the random graph (a.s.) consists of a giant component and some 
isolated vertices. 

Corollary 1.4. Let c be a constant larger than i. Then for any clnn/n < p < 1/2, 
the giant component ofG(n,p) is almost surely non-singular. 

Furthermore, if c > 1, then G(n,p) almost surely is connected and contains no 
isolated vertices. 

Corollary 1.5. Let c be a constant larger than 1. Then for any c In n/n < p < 1/2, 
G(n 7 p) is almost surely non-singular. 

It follows that the non-singularity of G(n,p) has a sharp threshold at p — In n/n. 
For any positive constant e, if p < (1 — e) In n/n, then G(n,p) is almost surely 
singular as it contains isolated vertices. On the other hand, the above corollary 
asserts that G(n,p) is almost surely non-singular for p > (1 + e) lnn/n. 

The special case p = 1/2 was a well known conjecture of B. Weiss, posed many years 
ago. This special case can be viewed as the symmetric version of a well-known the- 
orem of Komlos on the non-singularity of (non-symmetric) random Bernoulli ma- 
trices [4] and was solved two years ago in [1]. The proof of Theorem 1.2 extends the 
ideas in that paper and combines them with arguments involving random graphs. 
In the next section, we outline this proof and present the key lemmas. In Section 
3, we prove the theorem assuming the lemmas. Most of the rest of the paper is 
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devoted to the proofs of the lemmas. The last section contains several remarks and 
open questions. 

Notation. In the whole paper, we will always assume that n is sufficiently large. 
As usual, the asymptotic notation is used under the condition that n — > oo. P 
denotes probability and E denotes expectation. 

2. Outline of the proof and the main lemmas 

We assume that o(l) = p > clnn/n for a constant c > 1/2. The treatment of the 
case p = f2(l) was presented in [1] and is omitted here. We denote by Q(n,p) the 
adjacency matrix of G(n,p). 

Following the ideas from [4, 1], we are going to expose Q(n,p) minor by minor. 
Letting Q m denote the upper left m x m minor of Q(n,p), we view Q m +i as 
being formed by taking Q m and augmenting by a column whose entries are chosen 
independently, along with the column's transpose. Denote by G m the graph whose 
adjacency matrix is Q m . In graph theoretic terms, we are considering the vertex 
exposure process of G(n,p). 

Our starting observation is that when a good portion of the vertices have been 
exposed, the matrix has rank close to its size. 

Recall that p > clnn/n for a constant c > 1/2. We can set a constant < S < 1 
such that 1/2 < 5c < 3/5. Define n' := Sn. 

Lemma 2.1. For any constant e > there exists a constant 7 > such that 
P(rank(Q n ,) < (1 - e)n') = o{e^ nlnn ) 

Our plan is to show that the addition of the remaining n — n' rows/columns is 
enough to remove all the linear dependencies from Q n i , except those corresponding 
to the isolated vertices. 

The next lemmas provide some properties of G m for n' < m < n. 

Definition 2.2. A graph G is well-separated if it contains no pair of vertices of 
degree at most In Inn whose distance from each other is at most 2. 

Lemma 2.3. For any constant e > 0, G m is well separated for every m between n' 
and n with probability 1 — 0(n 1 ~ 2cS+e ). 

Here and later on, we always choose e sufficiently small so that 1 — 2cS+e is negative. 

Definition 2.4. A graph G is a small set expander if every subset S of the 
vertices of G with l^l < ln3 " 2 - containing no isolated vertices has at least \S\ edges 
connecting S to S, its complement. 

Lemma 2.5. For any m > n' the probability that G m is well separated but is not 
a small set expander is 0(l/n 3 ). 
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Remark 2.6. Lemmas 2.3 and 2.5 immediately imply that almost every graph en- 
countered in our process after time n' will be a small set expander. We cannot 
expect this to occur for p < ^ 5 ~~^ ln " , as at this density the random graph will 
likely contain pairs of adjacent vertices of degree 1, leading to sets of size 2 without 
any edges at all leaving them. 

Definition 2.7. A set S of the vertices of a graph G is nice if there are at least 
two vertices of G each is adjacent to exactly one vertex in S. 



Set k := 1=£2 

Definition 2.8. A graph G is good if the following two properties hold: 

1 . Every subset of the vertices of G of size at least 2 and at most k which contains 
no isolated vertices is nice. 

2. At most -pj^ vertices of G have degree less than 2. 

A symmetric (0,1) matrix A is good if the graph for which it is an adjacency matrix 
is good. 



The next lemma states that in the augmentation process we will likely run only 
into good matrices. 

Lemma 2.9. Let e be a positive constant. Then with probability 1 — 0(n 1 ~ 2cS+e ), 
Q m is good for every m between n! and n . 



Finally, we have two lemmas stating that good matrices behave nicely when aug- 
mented. 

Definition 2.10. A pair (A, A') of matrices is called normal if A' is an augmen- 
tation of A and every row of all 0's in A also contains only 0's in A' (in graph 
theoretic terms, the new vertex added by the augmentation is not adjacent to any 
vertices which were isolated before the augmentation). 

Lemma 2.11. Let A be any fixed, good m x m matrix with the property that 
rank(A) + i(A) < m. Then 

P( rank(Q m+1 ) - rank(Q m ) < 2\(Q m ,Q m+1 ) is normal A Q m = A) = 0((kp)~ 1/2 ). 

Lemma 2.12. Let A be any fixed, good, m x m matrix with the property that 
rank(A) + i(A) = m. Then 

P( rank(Q m+ i) - rank(Q m ) < l|(Q m ,Q m +i) is normal A Q m = A) = 0((fcp) _1/4 ). 



Set Y m = m — (rank(Q m ) + i(Q m j). The above two lemmas force Y m to stay near 
0. Indeed, if Y m is positive, then when the matrix is augmented, m increases by 1 
but the rank of Q m will likely increase by 2 (notice that kp — ► oo), reducing Y m . 
On the other hand, if Y m = 0, it is likely to stay the same after the augmentation. 

In the next section, we will turn this heuristic into a rigorous calculation and prove 
Theorem 1.2, assuming the lemmas. 



THE RANK OF RANDOM GRAPHS 



5 



3. Proof of the Main Result from the Lemmas 



In this section, we assume all lemmas are true. We are going to use a variant of an 
argument from [1]. 

Let B\ be the event that the rank of Q n > is at least n'(l — ^f-)- Let £? 2 be the 
event that Q m is good for all n' < m < n. By Bayes' theorem we have 

P( rank(Q„) + i(Q n ) < n) < P( rank(Q„) + »(Q„) < n A B 2 |Bi) + P(-Bi) + P(-B 2 ) 

By Lemma 2.1 we have that P(^Bi) = o(e-'< nlnn ) and by Lemma 2.9 P(^B 2 ) = 0{n 1 - 2cS+t ). 
Both probabilities are thus much smaller than the bound (^((lnlnn)" 1 / 4 ) which we 
are trying to prove. So, it remains to bound the first term. 

Let Y m = m — rank(Q m ) — i(Q m ). Define a random variable X m as follows: 

• X m = 4 Ym if Y m > and every Qj with n' < j < m is good; 

• X m = otherwise. 

The core of the proof is the following bound on the expectation of X m+ i given any 
fixed sequence Q m of matrices {Q n >, Qn'+i, ■ ■ ■ , Qm} encountered in the augmen- 
tation process. 

Lemma 3.1. For any sequence Q m = {Q n >, Q n '+i, ■ ■ ■ , Qm} encountered in the 
augmentation process, 

E(X ro+1 |Q m ) < ^X m + 0((lnlnn)- 1 / 4 ). 
5 

Let us (for now) assume Lemma 3.1 to be true. This lemma together with Bayes 
theorem shows that for n' < m we have 

V(X m+1 \Q n ,) < ^E(X m |Q„0 + O((lnlnn)- 1 / 4 ). 

By induction on to 2 — mi we now have that for any m 2 > mi > n' 

E(X m2 |Q„0 < (^) m2 - mi E(X mi |Q„0 + O((lnlnn)- 1 /4). 

In particular, by taking m 2 = n and mj = n' we get that 

E(X„|Q„0 < (^) n - n 'X n , +0((lnlnn)- 1 /4 ) _ 

If Q n > satisfies B\, we automatically have X n i < = (^/2) n ~ n , so 

E(X n |Q n ,) < (^)"-"' +0((lnlnn)- 1 / 4 ) =0((lnlnn)- 1 /4). 
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By Markov's inequality, for any Q n > satisfying B\ 

P{X n >3|Q„0 -©((lnlnn)- 1 / 4 ) 

On the other hand, by definition X n > 4 if rank(Q„) + i(Q n ) < n and B 2 holds. It 
thus follows by summing over all Q n > satisfying B\ that 



P( rank(Q„) + i{Q n ) < n A B^) = O^lnlnn)- 1 / 4 ), 
proving the theorem. 

It remains to prove Lemma 3.1. If a matrix in the sequence Q m = {Q n ',Qn'+i, ■ ■ ■ ,Qm} 
is not good, then X m+ i = by definition and there is nothing to prove. Thus, from 
now on we can assume that all matrices in the sequence are good. 

Let Z m denote the number of vertices which were isolated in Q m but not in Q m +\- 
If Z m is positive, then augmenting the matrix will increase Y m by at most Z m + 1 
(to increases by 1, the number of isolated vertices decreases by at most Z m , and 
the rank docs not decrease). Furthermore, Z m = if and only if (Q m ,Qm+i) is 
normal. 

By Bayes' theorem, we have 

E(X m+1 \Q rn ) = E{X m+1 \Z m > A Q m )P(Z m > 0\Q m ) 

+ F,(X m+1 \Q m A (Q m ,Q m +i) is normal)P((Q m ,(5 m+ i) is normal|Q m ) 

< E(X m+ i X (Z m > 0)\Qm) + E(X m+1 \Q m A {Q m , Q m +i) is normal) 

= E(4 z ™ +1 + Y ™ x (Z m > 0)\Q m ) + V{X m+1 \Q m A (Q m ,Q m +i) is normal). 

Since Q m is good, G m has at most isolated vertices. Thus, we can bound Z m 
by the sum of —r— random Bernoulli variables, each of which is 1 with probability 

J p Inn ' t- j 

p. It follows that 

nz m = i)<^fy<{innr. 

Adding up over all i 1 we have 

oo 

E (4^+i x (Z m > 0)|Q m ) < ^4 J+1 (lnn)- 4 = (^((lnn)- 1 ). 

i=i 

If Y m — and (Q m , Qm+i) is normal, then by Lemma 2.12 (which applies since Q m 
is good) X m+ i is either or 4, with the probability of the latter being (3((ln lnn)^ 1 / 4 ). 
Therefore we have for any sequence Q m = {Q n >, ■ ■ -Qm} of good matrices with 
Y m = that 

nX m+1 \Q m ) - Oainlnn)- 1 / 4 + (Inn)- 1 ) = (^(lnfon)- 1 / 4 ). (1) 
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If Y m = j > and (Q m , Q m +i) is normal, then Y m +i is j — 1 with probability 

1 — 0((lnlnn) -1 / 2 ) by Lemma 2.11, and otherwise is at most j + 1. Combining 

this with the bound on E(4 Zm+1 x(Z m > 0)|Q m ) we have 



E(X m+1 \Q m ) = 4P- 1 +4^+ 1 0((lnlnn)- 1 / 2 ) + 4^0((ln n)" 1 ) < |# (2) 
The lemma now follows immediately from (1) and (2). 

4. Proof of Lemma 2.1 
By symmetry and the union bound 

P(rank(Q n ,) < (1 - e)n') < (j^j x P(B*), 

where B{ denotes the event that the last en' columns of Q' n are contained in the 
span of the remaining columns. 

We view Q n ' as a block matrix, 



Qn' 



A 


B ' 


[ B ' r 


C 



where A is the upper left (1 — e)n' x (1 — e)n' sub-matrix and C has dimension 
en' x en'. We obtain an upper bound on P(B*) by bounding the probability of Bjf 
conditioned on any fixed A and B (treating C as random). 

B\ cannot hold unless the columns of B are contained in the span of those of A, 
meaning the equation B = AF holds for some matrix F. If this is the case, then 
B\ will hold only when we also have C = B T F. This means that each entry of C 
is forced by our choice of A, B and our assumption that B^ holds. 

However, C is still random, and the probability that any given entry takes on its 
forced value is at most 1 — p. The entries are not all independent (due to the 
symmetry of C), but those on or above the main diagonal are. Therefore the 

probability that B\ holds for any fixed A and B is at most (1 — p) 2 . 
We therefore have 



P(rank(Q„0 < (1 - e)n') < 

< 
< 



en' 



, ((1-P) 



„') 2 



ne / -p(eV) 2 
— e ^ 
en' 

^n^-cin In n 
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where ci and C2 are positive constants depending on e, 5, and c (but independent 
of n). 

Remark 4.1. The same argument gives an upper bound of c 2 "e~ Cl " 2p on the prob- 
ability for any n and p. Holding e fixed, we see that the probability becomes o(l) 
for p = y/n with sufficiently large fixed y. In particular, if p —* and np — > oo, 
then rank(<5n, p )/n — > 1. 



5. Proof of Lemma 2.3 



If p is at least (In n) 2 /n' then G(m,p) will with probability at least 1 — o(l/n 3 ) 
have no vertices with degree at most In Inn, in which case the lemma is trivially 
true. Therefore we can assume p < (In n) 2 /n' 

If G m fails to be well separated for some to between n' and n there must be a first 
mo with this property. We are going to bound the probability that a fixed m is mo- 

Case 1: too = n'. The probability that G' n fails to be well-separated is at most n 2 
times the probability that any particular pair of vertices v and w are both of small 
degree and at distance at most 2 from each other. 

The probability that v has sufficiently small degree is at most 

(In n) 



lnln n ^ ' 1 \ , l 1^J l , rinr)l lnln ' 

£ , -*>)"'-*< (i + o(i)) £>'„)'(! <^r~ 



=0 V * / *=o n ' (3) 



and the same holds for w even if we assume v has small degree (although the degree 
of v and that of w aren't quite independent, we can bound the probability w has 
small degree by the probability it has at most In In n neighbors not including v) . 

Since a given pair of vertices both being of small degree is a monotone decreasing 
graph property, the FKG inequality gives that the probability of an edge being 
present between v and w is at most p even after we condition on them both being 
small degree. Similarly, the probability of the existence of an x adjacent to both v 
and w is at most np 2 . Combining these facts, the probability of two small degree 
vertices being close is at most 

n nn )inin„ (lnn) 21nln "ln 4 n i_ 2c5+e , 
" ( - „cs ) (P + n P ) ^ ~ 7^S=i = °(» )• 

Case 2: nl < too = m < n. We bound the probability that m satisfies the somewhat 
weaker condition that G m -\ is well separated but G m is not. Let w be the vertex 
newly added to the graph. There are only two ways that the addition of w can 
cause G m to lose well-separatedness: either w serves as the link between two low 
degree vertices v\ and i>2 that were previously unconnected, or w is itself a low 
degree vertex of distance at most 2 from a previous low degree vertex vq. 
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Applying (3) twice, the probability that any particular v\ and V2 both have low 
degree and w is connected to both of them is at most p 2 ( ^ ln n ^ ci ) 2 . 

Again by (3) the probability that w is of low degree is at most ( - ln n ^ ci , as is 

the probability that any particular choice of candidate for vq has low degree. By 
the FKG inequality, the probability that w and our candidate vq share a common 
neighbor given they both have small degree is at most np 2 . 

Since there are at most n 2 choices for v\ and v-i and at most n choices for vo, 
applying the union bound over all these choices, we obtain that the probability 
G m _i is well connected but G m is not is at most 

( (lnn J'; ln " ) 2 (2nV) = o(n- 2 ^) 

Applying the union bound over all possible to (there are at most n values for to), 
we obtain that the probability of the existence of such an m is o(n 1 ~ 2cS+e ). The 
proof is complete. 



6. Proof of Lemma 2.5 

In order to prove the edge expansion property we first show that almost surely all 
small subgraphs of G(n, c1 ^" ) will be very sparse. 

Lemma 6.1. For fixed c the probability that G(n, has a subgraph containing 

at most ln3 " 2 n vertices with average degree at least 8 is 0(n~ 4 ) 

Proof Let qj be the probability that a subset of size j has at least Aj edges. By 
the union bound, this is at most (™) times the probability of a particular subset 
having at least Aj edges, so 

• <- 

ne ■ ejclnn 4 - 

< ( ce 5 j 3 ln 4 n 
n 3 

For j < n 1 / 4 this gives qj < n~ 2j , while for j > n 1 / 4 we have (using our upper 
bound on j) qj < (Inn) - - 7 / 2 = o(n~ 5 ). By adding up over all j at least 2, we can 
conclude that the failure probability is 0(n~ 4 ), completing the proof. ■ 
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Armed with this lemma we can now prove Lemma 2.5; we do so in two cases 
depending on the value of p. 



Case 1: p < 



12 In n . 



Suppose that G failed to expand properly. If this is the case, there must be a mini- 
mal subset So with fewer than |So| edges leaving it. If any vertex in So were adjacent 
to no other vertex in So, it would have a neighbor outside So (since So contains 
no isolated vertices), and dropping it would lead to a smaller non-expanding set, 
a contradiction. Therefore every vertex in So has a neighbor in So- By the well- 
separatedness assumption the vertices of degree at most In Inn are non- adjacent 
and share no common neighbors. Thus it follows that at most half the vertices 
in S are of degree at most In Inn. Since at most |S | edges leave S , it follows 
that there are 0(|So| In Inn) edges between vertices of So- But by Lemma 6.1 the 
probability an So with this many edges exists is 0(-^), competing the proof for 
this case. 

Case 2: p > 12 We estimate the probability that there is a non-expanding 
small set directly by using the union bound over all sets of size i < n ln~ 3 / 2 n. The 
probability in question can be bounded from above by 



i—1 \ / V / i=l 



)) 



1 nln~ 3/2 n 

= _ ^ {n 2 e- np< - 1+ °^y. 



i=l 



The lower bound on p guarantees that the summand is 0(n~^ 4+ °^ z ), so the prob- 
ability for any p in this range is o(^). Notice that in this case we do not need the 
well-separatedness assumption. 



7. Proof of Lemma 2.9 



Let Co be the event that G m is good for every m between n' and n. Let C\ be the 
event that G m has at most j^-^ vertices of degree less than 2 for every m between 
n' and n, C2 be the event that G m has maximum degree at most lOnp for each m, 
and C3 be the event that G m is well separated and a small set expander for every 
m between n' and n. We have 

P(-Co) < P(-.C AC1AC2A C 3 ) + P(-Ci) + P(-.C 2 ) + P(-.C 3 ). 

It suffices to bound each term on the right hand side separately, and we will do so 
in reverse order. 



Lemmas 2.5 and 2.3 together show that P(^C 3 ) = C^n 1 - 2 ^ 6 ). 
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P(^C2) is at most the expected number of vertices of degree at least lOnp in G. 
which is at most 



n 



(lOr ^j pWnP - n(e/10p) Wnp p Wnp < ne- Wnp = o(n" 4 ). 



To bound P(-iCi), we note that he probability that some G m contains a set of 
vertices of degree less than 2 of size s = p^ 1 In -1 n is bounded from above by the 
probability that at least s vertices in G n each have fewer than 2 neighbors amongst 
the vertices of G n > , which by Markov's inequality is at most 

-n{n'p{\ - p) n '- 2 + (1 - p) 11 '- 1 ) = 0(sn 2 pe- Snp = o^ 1 / 3 ). 



It follows by the union bound that P(-Ci) = o(n- 1 / 3 ). 

It remains to estimate the first term, which we will do by the union bound over 
all m. Since property C\ implies G m has few vertices of small degree, it suffices to 
estimate the probability G m contains a non-nice set while still satisfying properties 
C\, C 2 , and C 3 . 

Let pj be the probability that conditions C\ , Ci , and C3 hold but some subset of 
at most j vertices without isolated vertices is not nice. Symmetry and the union 
bound give that pj is at most (™) times the probability that the three conditions 
hold and some fixed set S of j vertices is not nice. We will do this in three cases 
depending on the size of j. 

Case 1: — I— < j < k. 

pvln n 

Direct computation of the probability that a fixed set of j vertices has either or 
1 vertices adjacent to exactly one vertex in the set gives: 



Pj 



< 



l ) ((l - j P (i - py-'r + mj P {i - P y-\i - mi - py-'r- 1 ) 



m 

j . 

< (mepVi^y ((1 - jp{i - py- 1 )" 1 + mj P (i - py-^i - j P (i - py- 1 )™- 1 ) 

< (mepVh^) j {{l - j pe -Mi+o(i)))m + m jp e -^ 1+0 ^{l - j pe -Mi+o(i)))™-i) 

< (m e p^)^( e - m ^ 1 +°( 1 )) e ^ P(1+0<1)) )(l + m jpe-^ 1+0 ^). 



It follows from our bounds on j and p that mjpe ■ 7P tends to infinity, so the second 
half dominates the last term of the above sum and we have: 



Pj < (mepVln^^( e - m ^ 1 + ( 1 )) e ^ P(1+0<1)) )(2mjp e -^ 1 + ( 1 ))). 
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Taking logs and using Sn < m < n gives: 

HPj) < (1 + o(l))j(ln(enpVinn) - (fope"^ 1 ^ 1 " - p + i^M; 
< (1 + o(l))j(41n(np) - ^e-^f 1 ^ 1 ") 

(In n) 



= (l + (l))j(41n(np)- 3 T ). 



Since np > 0.5 Inn, taking n large gives that the probability of failure for any 
particular j in this range is o(l/n 4 ), and adding up over all j and m gives that the 
probability of a failure in this range is o(l/n 2 ). 



Case 2: <j< — 4=. 



Let 6 be the number of vertices outside S adjacent to at least one vertex in S, and 
let a be the number of edges between S and the vertices of G outside S. If G m is 
to satisfy the properties Ci and C3 it must be true that j < a < Wjnp. 

Next, we note that if S is not nice, then at least b — 1 of the neighbors of S must 
be adjacent to at least two vertices in S. This implies that b < ^5— . It follows that 



L ?(b<^-\a = w)+I>(a<Wj) max P(6 < — -|a = 

\J J W]<w<10]np Z j<w<10] Z 



To bound P(6 < 2i! ^|a = to), we fix a set of vertices and bound the probability 
that w consequentially randomly selected vertices were in that set. Using the union 
bound over all possible sets of vertices, we have 



< 



< 



< 



w± 
2 



2(m - j) ' 



( gg(m-j) «gi w 



w 
4w 
m 



2(m- j) 



This last bound is decreasing in uu for the entire range under consideration (our 
bounds on j guarantee w is at most ^= ) . Therefore we can plug in the smallest 
values of w in (4) to get 
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< 3^f m )(P(«<10,)(^)U(^) 

\j J m m 

< 3 ^ { ™y { p {a<m{ e l)i +{ ^) 

< 3V^(P(a<10j)(^)^ + (^) 4j ). 

J n 



a here is the sum of at least n'j(l + o(l)) independent Bernoulli variables each with 
probability of success at least We therefore have 

P(a<10j) < ^f n ^(£^)*(l-£^)»'J(i-H<i)) 



j=0 



= n 
yielding 



4=0 

jc5(l+o(l)) 



P ^ 4 ^ (( VJn(^-V 2 ) J + ^) 4j )- 



Both terms are decreasing in j in the range under consideration, and plugging in 
the lower endpoint of our range gives that pj = o(l/n ) for each j and m in the 
range. By the union bound the probability of failure in this range is o(l/n 2 ). 

Case 3: 2 < j < 

Let a and b be as in case 2. We again bound the probability of failure for any fixed 
set of vertices by the probability that b < ^±1. . 

We first note that if condition C3 is to be satisfied then this inequality must also 
be satisfied any time a is at most 10 j. This is because if a is in this range it follows 
that every vertex in our set is of degree below In Inn, and the well-separatedness 
condition then guarantees that each edge leaving our set must go to a different 
vertex. 

Because of this, we can rewrite equation (4) as 



( m \ I T-»/J w + 1 . . 

p, < ( max P(o < a — w) 

\ j J Wj<w<Wjnp 2 

< 4VS(^)« 
n 
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where the second inequality comes from our computations in case 2. Adding up 
over all j in this range gives that the probability of failure in this range is o(n~ 3 ). 

8. Some Littlewood-Offord-Type Results 

The proof of the remaining two lemmas rely on modifications to the following lemma 
of Littlcwood and Offord [2]: 

Lemma 8.1. Let cii be fixed constants, at least q of which are nonzero. Let 
Zi, Z2, . ■ . z n be random, independent Bernoulli variables which take on and 1 
each with probability 1/2. Then for any fixed c, 

n 

P(£a i z i = c) = 0(q- 1 / 2 ), 
»=i 

where the implied constant is independent of n, the ai, and c. 

The variables we are now considering, however, are not equally likely to be and 
1. Thus we need the following special case of a more general result in [3]. 

Lemma 8.2. Let ai be fixed constants, at least q of which are nonzero. Let 
zi, Z2, ■ ■ ■ z n be random, independent Bernoulli variables which take on 1 with prob- 
ability p < |, with probability 1 — p. Then for any fixed c, 

n 

P(J2^z l = c) = 0((qp)-^ 2 ), 
i=l 

where the implied constant is absolute. 

Remark 8.3. The theorem is also true (with near identical proof) if one replaces the 
distribution of the Zi by one with P(zi = 1) = P(zi = — 1) = p, P(zi = 0) = 1 — 2p 

Proof 

Let ri be Bernoulli random variables taking on 1 with probability 2p, and with 
probability 1 — 2p. Let s, be random variables taking on 1 and with equal 
probability, and replace z, by r,Sj (which has the same distribution). By Bayes' 
inequality we have 

n n n n 

P{^2(a l r i )s l = c) < P(^ a i r i s l = c\ ^ r * > QP) + P (52 Ti K qp ^ 

i—l i—1 i—1 i—1 

Since E(X^=i r i) = %QP an d ^ ar (S"=i r i) — %qp, by Chebyshev's inequality the 
second term on the right is 0{(qp)~ x / 2 ). In the first term there are at least qp 
nonzero a^i, so the bound follows immediately from the original Littlcwood-Offord 
lemma. ■ 
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The other modified Littlcwood-Offord result we need is a similar modification of 
the Quadratic Littlewood-Offord lemma in [1]: 

Lemma 8.4. Let a-ij be fixed constants such that there are at least q indices j such 
that for each j there are at least d indices i for which ^ 0. Let Z\, z 2 , ■ ■ ■ z n be 
as in Lemma 8.2. Then for any fixed c 

n n 

p(EE fl «i^ = c ) = %f)" 1/4 )> ( 5 ) 
»=i j=i 

where the implied constant is absolute. 
Proof 

The proof of Lemma 8.4 relies on the use of the following application of the Cauchy- 
Schwartz inequality: 

Lemma 8.5. Let X and Y be random variables, and let E(X, Y) be an event 
depending on X and Y . Let X' be an independent copy of X . Then 

P(E(X,Y)) < (P(E(X,Y) AE(X',Y))^ 2 

Proof 

By discretizing, we can reduce to the case when X takes a finite number of values 
xi,...,x n . From Bayes' identity we have 

n 

P(E(X, Y)) = P(E(x u Y))P(X = Xi ) 

i=l 

and 

n 

P(E(X, Y) A E(X, Y')) = J2 P(E(xi,Y)fP(X = Xi ), 

and the result follows immediately from Cauchy-Schwartz. ■ 

Without loss of generality we can assume that the q indices j given in the assump- 
tions of our lemma are 1 < j < q. 

Define X :— (zi) i>q / 2 , Y :— (zi) i<q / 2 . Let Q(X,Y) be the quadratic form in (5), 
and let E(X, Y) be the event that that form is 0. By Lemma 8.5 we have 

P(Q(X, Y) = cf < P(Q(X, Y) - Q(X', Y) = c) < P(Q(X, Y) - Q(X', Y) = 0) 

Thus it is enough to show the right hand side of this is 0((gp) -1 / 2 ). To estimate 
the right hand side, we note that 

Q(X,Y)-Q(X',Y)= W jZj + f(X,X'), 
j<q/2 
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where 

W j = E a ij( z i~ z i) 
l>q/2 

and / is a quadratic form independent of Y. As in Lemma 8.2, we next use Bayes' 
inequality to condition on the number of nonzero Wj. Let Ij be the indicator 
variable of the event Wj =0. We have that 

P(E W jZj ) = -f(X,X')) 

j<d/2 

<p(E w i*i) = -k x > x ')\ E h < 9/4) + p(E h ^ 9/4) 

j<d/2 



By Lemma 8.2 the first term on the right hand side is 0((qp) 1 ^ 2 ) for any fixed 
value of A, so it immediately follows that the same holds true for X random. 

For the second term, we note that since Y only involves q/2 indices, each index in 
Y must have at least q/2 indices in X with a,j ^ 0. If follows from the remark 
following Lemma 8.2 that E(Ij) = 0((gp)" 1/2 ), so = 0(q(qp)- 1/2 ). By 

Markov's inequality, the second term is also 0((qp) -1 / 2 ), and we are done. ■ 



9. Proofs of lemmas 2.11 and 2.12 



The assumption that the pair (Q m ,Q m +i) is normal means that the rows in Q m 
which arc entirely have no bearing on the rank of Q m +i- Thus without loss of 
generality we can drop those rows/columns and assume that A has no rows which 
are all 0, at which point A will still be singular in Lemma 2.11, but will have become 
nonsingular in Lemma 2.12. 

Proof of Lemma 2.11. 

If the new column is independent from the columns of A, then the rank increases by 
two after the augmentation (since A is singular and the matrices are symmetric). 
Thus if the rank fails to increase by two then adding a new column does not increase 
the rank. 

Assume, without loss of generality, that the rank of A is D and the first D rows 
Xi, . . . ,Xd of A are independent. Then the last row x m can be written as a linear 
combination of A in a unique way 



D 



E a 
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By throwing away those a% which are zero, we can assume that there is some D' < D 
such that 



D 1 



where all m ^ 0. If D' + 1 < k, then there is a vertex j which is adjacent to exactly 
one vertex from S = {1, . . . , D' , to}, thanks to the goodness of Q m . But this is a 
contradiction as the jth coordinates of x m and Y^i=i a i x i d° n °t match (exactly 
one of them is zero) . Thus we can assume that D' > k — 1. 

Now look at the new column (yi, ■ ■ ■ ,y m ). Since the rank does not increase, we 
should have 



D' 

x m ^ ' 

i=l 



where x[ is the extension of Xi. This implies 



D' 

y m = ^aiyi. 

i=l 

Since all ai are non zero, by Lemma 8.2 the probability that this happens is 
OiiDp)- 1 / 2 ) = Oiikp)- 1 ' 2 ), concluding the proof. 

Proof of Lemma 2.12. Let A be a good non-singular symmetric matrix of order 
to. Let A' be the m + 1 be m + 1 symmetric matrix obtained from A by adding a 
new random (0, 1) column u of length to+ 1 as the to+ 1st column and its transpose 
as the to + 1st row. 

Let Xi, . . . , x m+ i be the coordinates of u; x m+ i is the lower-right diagonal entry of 
A' and is zero. The determinant for A' can be expressed as 



mm mm 

(det j4)x m ^i -\- ^ y ^ ^ CijXiXj — ^ ^ ^ ^ CijXiXj = Q 

i—l j — 1 i—1 j — 1 

where Cjj is the ij cofactor of A. It suffices to bound the probability that Q = 0. 
We can do this using Lemma 8.4 if we can show that many of the Cy are nonzero. 



Since A is now nonsingular, dropping any of the columns of A will lead to a to x to— 1 
matrix whose rows admit (up to scaling) precisely one nontrivial linear combination 
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equal to 0. If any of the rows in that combination are dropped, we will be left with 
anra-lxm-1 nonsingular matrix, i.e. a nonzero cofactor. 

As above, that combination cannot involve between 2 and k rows (since A is good, 
any set of between 2 and k rows has at least two columns with exactly one nonzero 
entry, and even after a column is removed there will still be one left). The com- 
bination will involve exactly 1 row only when the column removed corresponds to 
the only neighbor of a degree 1 vertex (which becomes isolated upon the removal 
of its neighbor). But by assumption there are only -p^tn = ^(l^n) possibilities for 
such a neighbor. 

It follows that for each index j except at most 0(^— ) indices there are at least 
k indices i for which Cjj ^ 0, and we can therefore apply Lemma 8.4 with q = k 
to get that P(Q = 0) and P(Q = — dct-A) are both 0((pfc)~ 1//4 ), proving Lemma 
2.12. 



10. Open Problems and Avenues for Further Research 

Theorem 1.5 gives that — is a threshold for the singularity of Q(n,p) but it would 
still be of interest to describe the sources of singularity once p drops below the 
threshhold. For instance, Theorem 1.2 states that for p > \nn/2n almost surely 
the only cause of singularity is the presence of isolated vertices. However, once p 
drops below this lnn/2n this will no longer be the case, as G(n,p) will begin to 
acquire pairs of degree one vertices having a common neighbor (corresponding to 
pairs of equal rows in Q(n,p j). 

As noted in remark 4.1, it is still the case at this point that rank(<5„ jP )/n — > 1, 
and this will continue to occur until pn = O(l). For y fixed and p = y/n, we have 
from consideration of isolated vertices and the bounds in Lemma 2.1 that 



1 - OQny/y) < (1 + o(l)) rank(Q„, p )/n < 1 - e~ y = 1 - i(G)/n 

It seems likely that E( rank(Q„ . y / n ))/ n tends to some function g(y) as n — > oo, 
and it would be of interest to compute g. Azuma's inequality applied to the vertex 
exposure process guarantees the ratio is highly concentrated around this g, whatever 
it may be. 

Let us now consider the case when p is above the threshold In n/n. What is the 
probability that Q(n,p) is singular ? The current proof gives bounds which tends 
to zero rather slowly. For p > n~ a we can prove the singularity probability is 
0(n _1 / 4 ( 1_2a )). However, forp < n -1 / 2 , we can only prove 0((lnlnn) -1 / 4 ). While 
it is certain that these bounds can be improved by tightening the arguments, it is 
not clear how to obtain a significant improvement. For instance, we conjecture 
that in the case p = 6(1), the singular probability is exponentially small. Such 



THE RANK OF RANDOM GRAPHS 



19 



bounds are known for non-symmetric random matrices [5, 6, 7], but the proofs do 
not extend for the symmetric case. 

We think that Corollary 1.5 is relatively easy to extend to models of random graphs 
where the edges are independent, but are included with different probabilities. 
However, the assumption of independence between the edges of G seems crucial. 
In particular, the results in this paper do not yet apply to the model of random 
regular graphs. 

Question 10.1. For what d will the adjacency matrix of the random d-regular 
graph on n vertices almost surely be nonsingular? 

For d = 1, the matrix is trivially non-singular. For d = 2, the graph is union of 
cycles and the matrix will almost surely be singular (any cycle of length a multiple 
of 4 leads to a singular adjacency matrix). We conjecture that for 3 < d < n — 4, 
the matrix is again almost surely nonsingular. 
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